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Data collected in Run II of the Fermilab Tevatron are searched for indications of new 
electroweak-scale physics. Rather than focusing on particular new physics scenarios, CDF 
data are analyzed for discrepancies with the Standard Model prediction. A model-independent 
approach (Vista) considers gross features of the data, and is sensitive to new large cross- 
section physics. Further sensitivity to new physics is provided by two additional algorithms: 
a "Bump-Hunter" searches invariant mass distributions for "bumps" that could indicate res- 
onant production of new particles; and the Sleuth procedure scans for data excesses in the 
high-y^pr tails. This combined global search for new physics in 2.0 fb _1 of pp collisions at 
yfs = 1.96 TeV reveals no indication of physics beyond the Standard Model. 



1 Introduction 



The Standard Model (SM) of particle physics has been remarkably succesful in describing ob- 
served phenomena, but is generally believed to require expansion beyond the electroweak scale. 
A wide variety of possible extensions have been proposed, but no experimental evidence has yet 
been obtained. As an alternative to searching for a specific new physics scenario, the present 
analysis attempts to perform a model-independent global search of the CDF high-pr data, to 
ensure that we do not miss some clear sign of new physics that happens to be lurking in an 
obscure place in the data. 

Section [2] describes a procedure named Vista which searches for discrepancies in the bulk 
features of the high-/?T data. An algorithm to scan invariant mass distributions for "bumps" 
that could indicate the resonant production of new particles is described in Section [3j Section [J] 
describes Sleuth, a quasi-model-independent search for data excesses at high J^Pt- F urth er 
details of this global search can be found in previous publications based on 927 pb" 1 of dataff^In 
this global search, a discrepancy relative to the SM prediction is only considered to be significant 
if it corresponds to a > 5a effect. 
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Figure 1: Summary of VlSTA global comparison of data to SM prediction for populations of final states (left) 
and shapes of kinematic distributions (right) . The black line represents the theoretical expectation assuming no 

beyond-SM physics. 



2 Vista 



Vista attempts to determine if the bulk features of the CDF high-py data can be described 
in terms of the Standard Model alone. Standard criteria are used to identify electrons, muons, 
taus, photons, jets, and jets tagged as originating from a b quark. Significant missing transverse 
energy is also treated as an object. All objects are required to have pt > 17 GeV/c. Over 
four million high-p-p events are considered in this 2.0 fb -1 analysis. Events are partitioned 
into exclusive final states, labelled according to the objects in each state. Standard HEP event 
generators are used to generate events from SM processes, and the response of the CDF detector 
is simulated by a GEANT-based software package. 

A true SM prediction, however, needs in addition some numbers that can only be obtained 
from the data themselves. These parameters include theoretical k-factors for the cross-sections 
of the SM processes, and experimental efficiencies for correctly and incorrectly reconstructing 
objects in the detector. A first pass of this analysis is used to determine these correction factors 
from the data, and hence obtain a complete global SM prediction. 

The Vista global comparison first studies the number of events in each of the 399 high-pp 
final states considered. The results are shown in Figure [TJ After accounting for the trials factor 
associated with looking at so many final states, we find that no final state exhibits a significant 
population discrepancy. 

The shapes of kinematic distributions are also studied, using a Kolmogorov-Smirnov test 
to assess the agreement with the SM prediction. As shown in Figure [IJ 19,650 distributions 
are investigated, with 559 found to have a significant discrepancy. A representative example 
is shown in Figure [2 However, analysis of these distributions reveals that they probably arise 
from a difficulty modelling soft QCD jet emission in the underlying Monte Carlo events. 



3 Bump Hunter 

The Bump Hunter is an algorithm to search for narrow resonances in invariant mass distributions? 
A search window is defined based on the expected detector resolution for the combination of 
objects involved. Each invariant mass distribution is then scanned with the appropriate slid- 
ing window, looking for bumps in the data relative to the SM background (which is taken 
from the VlSTA global comparison). The significance of any bump found is determined by 




AR(j2,j3) M(j1,j2,j3,j4)(GeV) 

Figure 2: Left: One of the significant shape discrepancies seen by VlSTA, AR between the second and third 
highest pt jets in the Vista 3-jet final state. Right: Invariant mass of all four jets in the 4-jet final state, which 
contains the only significant bump found by the Bump Hunter (indicated by the dashed blue lines) . 

pseudo-experiments, and the appropriate trials factor associated with looking in many places is 
accounted for. 

The procedure scans 5036 invariant mass distributions, and only one is found to have a 
bump whose statistical significance exceeds the a priori discovery threshold. This distribution 
is shown in Figure EJ This bump arises from the same mismodelling of soft jets affecting the 
Vista shapes test. 

4 Sleuth 

The Sleuth algorithm considers a single variable, the summed scalar transverse momentum 
(J2pt) of all objects in the event. For each final state, Sleuth determines the most interesting 
region on the high tail of this distribution (defined as the region with smallest probability 
that the integrated SM background would fluctuate up to or above the number of data events 
observed). The significance of the most interesting region in each final state is evaluated via 
pseudo-experiments, and a trials factor accounts for examining many final states. 

The four most interesting final states are shown in Figure O While all of these final states 
all contain the rare signature of a same-sign electron-muon pair, none display a statistically 
significant excess. We estimate that ~ 8% of pseudo-experiments drawn from the VlSTA SM 
implementation would have produced a more significant excess in a single final state purely by 
chance fluctuations. Thus there is no discovery claim arising from Sleuth with 2.0 fb _1 . 

5 Conclusions 

CDF has performed a model-independent global search for new high-p^ physics in 2.0 fb _1 of 
data. The populations of 399 exclusive final states are compared to a Standard Model prediction, 
but no significant discrepancy is found after accounting for the trials factor associated with 
looking in many places. The shapes of 19,650 kinematic distributions are also compared to the 
Standard Model expectations, and although 559 show a significant discrepancy, this is attributed 
to difficulties modelling soft QCD jet emission in the underlying Monte Carlo prediction, rather 
than a sign of new physics. A similar explanation is found for the results of the Bump Hunter, 
an algorithm to scan invariant mass distributions for narrow bumps that could indicate resonant 
production of new particles: only one significant bump is found, and it is attributed to the same 
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CDF Run II data 
Pythia Z(-m) : 23% 
Pythia Z(->nu) : 22% 
Pythia jj : 20% 
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CDF Run II data 
Baur W(->nv)y : 33% 
Pythia Z(-inn) : 17% 
Alpgen W(->fiv) j : 9.9% 
MadEvent W(-Wv)jy : 7% 
Other 
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CDF Run II data 
Alpgen W(^^v) jjj : 1 8% 
MadEvent W(-Wv)fr : 13% 
Herwig tt : 1 3% 
MadEvent Z(->u^i) jj : 11% 
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Figure 3: The four most interesting final states found by Sleuth in 2.0 fb -1 . The label in the top left corner of 
each plot lists the objects in the final state, where r 1 is a lepton (e or //), (' is an additional lepton of different 
flavor, j denotes a jet and j/t represents missing transverse momentum. The region with the most significant 
excess of data over SM expectation is indicated by the blue line and displayed in the inset. The significance of 
the excess is shown by the value of V in the top right corner. 



underlying problem with QCD jet emission. The Sleuth algorithm searches the ~^2pt spectrum 
of each final state, but finds no significant excesses of data over SM prediction in the tails of 
any single distribution. Unfortunately, this CDF global search has not discovered new physics 
in 2.0 fb -1 . 
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